Schema Mediation in Peer Data Management Systems

نویسندگان

  • Alon Y. Halevy
  • Zachary G. Ives
  • Dan Suciu
  • Igor Tatarinov
چکیده

Intuitively, data management and data integration tools should be well-suited for exchanging information in a semantically meaningful way. Unfortunately, they suffer from two significant problems: they typically require a comprehensive schema design before they can be used to store or share information, and they are difficult to extend because schema evolution is heavyweight and may break backwards compatibility. As a result, many small-scale data sharing tasks are more easily facilitated by non-database-oriented tools that have little support for semantics. The goal of the peer data management system (PDMS) is to address this need: we propose the use of a decentralized, easily extensible data management architecture in which any user can contribute new data, schema information, or even mappings between other peers’ schemas. PDMSs represent a natural step beyond data integration systems, replacing their single logical schema with an interlinked collection of semantic mappings between peers’ individual schemas. This paper considers the problem of schema mediation in a PDMS. Our first contribution is a flexible language for mediating between peer schemas, which extends known data integration formalisms to our more complex architecture. We precisely characterize the complexity of query answering for our language. Next, we describe a reformulation algorithm for our language that generalizes both global-as-view and local-as-view query answering algorithms. Finally, we describe several methods for optimizing the reformulation algorithm, and an initial set of experiments studying its performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Peer-to-Peer Based Semantic Agreement Approach for Information Systems Interoperability

This paper focuses on P2P based data management and semantic mediation. We propose an approach based on a P2P for semantic interoperability of information sources that aims to combine the advantages of semantic mediation and peer-to-peer systems. It is based on a pure P2P with super peer architecture consisting of two types of peers. The super peer contains a reference ontology, which provides ...

متن کامل

Schema Mediation and Query Processing in Peer

P2P Data Management Systems (PDMSs) allow the efficient sharing of data between peers with overlapping sources of information. These sources share data through semantic mappings between peers. In current systems, queries are asked over each peer’s local schema and then translated using the semantic mappings between peers. In this thesis we propose that a mediated schema can benefit PDMSs by all...

متن کامل

A New OWL2 Based Approach for Relational Database Description

Nowadays, the scientific community is more and more interested by the mediation problem within Peer-to-Peer (P2P) systems and by data sources migration within the semantic web. Data integration and interoperability become a necessity to meet the need for information exchange between heterogeneous information systems. They reflects the ability of an information system to collaborate with other s...

متن کامل

Thematic Schema Building for Mediation-based Peer-to-Peer Architecture

Recently, mediation tools and peer-to-peer systems have allowed an important evolution for data sharing. Mediators are now mature techniques to share structured and heterogeneous data distributed through a reasonable number of nodes. Peer-to-peer architectures open new ways to build very large and dynamic networks allowing to share unstructured data as files indexed by some keywords. We propose...

متن کامل

Boosting a Network of Semantic Peers

In a Peer Data Management System (PDMS), semantic peers connect with each other through semantic mappings between their own schemas. Because of schema heterogeneity, due to peers’ autonomy as for data representation, querying a PDMS implies query reformulations across semantic mappings, possibly incurring in a semantic degradation due to the reiterated approximations given by the traversal of l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003